Preliminary Chinese Term Classification for Ontology Construction

نویسندگان

  • Gaoying Cui
  • Qin Lu
  • Wenjie Li
چکیده

An ontology can be seen as a representation of concepts in a specific domain. Accordingly, ontology construction can be regarded as the process of organizing these concepts. If the terms which are used to label the concepts are classified before building an ontology, the work of ontology construction can proceed much more easily. Part-of-speech (PoS) tags usually carry some linguistic information of terms, so PoS tagging can be seen as a kind of preliminary classification to help constructing concept nodes in ontology because features or attributes related to concepts of different PoS types may be different. This paper presents a simple approach to tag domain terms for the convenience of ontology construction, referred to as Term PoS (TPoS) Tagging. The proposed approach makes use of segmentation and tagging results from a general PoS tagging software to predict tags for extracted domain specific terms. This approach needs no training and no context information. The experimental results show that the proposed approach achieves a precision of 95.41% for extracted terms and can be easily applied to different domains. Comparing with some existing approaches, our approach shows that for some specific tasks, simple method can obtain very good performance and is thus a better choice.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese Core Ontology Construction from a Bilingual Term Bank

A core ontology is a mid-level ontology which bridges the gap between an upper ontology and a domain ontology. Automatic Chinese core ontology construction can help quickly model domain knowledge. A graph based core ontology construction algorithm (COCA) is proposed to automatically construct a core ontology from an English-Chinese bilingual term bank. This algorithm computes the mapping streng...

متن کامل

The Design and Implementation of Chinese Semantic Search Engine Based on FAQ Corpus and Ontology Construction from Information Extraction

Based on FAQ Corpus and Ontology Construction from Information Extraction Wen-Chih Chen, Lu-Ping Chang and Shi-Jim Yen Advanced e-Commerce Technology Lab., Institute for Information Industry, ROC National DongHwa University, Taiwan E-mail : {wjchen, clp}@iii.org.tw Abstract In the paper, we propose FAQ corpus and Ontology construction to implement Chinese semantic search engine. These frequentl...

متن کامل

When Conset Meets Synset: A Preliminary Survey of an Ontological Lexical Resource Based on Chinese Characters

This paper describes an on-going project concerning with an ontological lexical resource based on the abundant conceptual information grounded on Chinese characters. The ultimate goal of this project is set to construct a cognitively sound and computationally effective character-grounded machine-understandable resource. Philosophically, Chinese ideogram has its ontological status, but its appli...

متن کامل

An Ontology-Based Method for Extracting and Classifying Domain-Specific Compositional Nominal Compounds

In this paper, we present our preliminary study on an ontology-based method to extract and classify compositional nominal compounds in specific domains of knowledge. This method is based on the assumption that, applying a conceptual model to represent knowledge domain, it is possible to improve the extraction and classification of lexicon occurrences for that domain in a semi-automatic way. We ...

متن کامل

A Methodology for Domain Ontology Construction Based on Chinese Technology Documents

Ontology is considered as one of the most important roles in knowledge sharing and reusing. However, how to effectively construct the Chinese domain ontology is a difficult problem. This paper proposed a patternlearning Chinese domain ontology construction approach based on the fixed and simple characteristic of Chinese syntactic patterns in technology documents. The first step of this method i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008